Final project for the 22100 R for Bio Data Science course at DTU. The project has been carried out by the students:
The dataset used in the projects consists of miRNA expression data for 76 esophageal cancer patients from the US and Japan. It was generated and published by Mathé et al. and we retireved from the GEO website.
Here we aim to reproduce the findings of the authors and elaborate on their visualizations.
The data processing pipeline is summarised in the flowchart:
Even though we followed the steps described by Mathé et al., we have found many more differentially expressed miRNAs according to the same criteria. By plotting the differentially expressed miRNAs, we corroborate that the miRNAs that we have found are, in fact, more differentially expressed than the ones found by Mathé et al..